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Data Storage Control Program and Data Storage Control Method 

The invention has an XML engine (103) for receiving data to be 
stored, and a policy control unit {104} which, taking into 
consideration characteristics of this data (degree of popularity 
degree of urgency, degree of importance, etc), looks up pre' 
specified policy data and stores the data in a recording medium 
(a disk, a cache memory) of a prescribed node. 
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Data Storage Control Program and Data Storage Control Method 
Technical Field 

The present invention relates to a data storage control program and to a data 
storage control method for storing data in a recording medium (a disk, a cache 
memory) of a prescribed node in accordance with a prespecified policy. 
Background Art 

Recently, due to increasing use of the Internet, broadband networks, mobile 
telephony, persona! digital (data) assistants (PDAs) and so forth, there has been 
remarkable technical progress relating to electronic government and electronic 
commerce. High performance and high reliability are requirements of systems to be 
used for electronic government and electronic commerce, and such systems also have 
15 to be able to deal with large amounts of data. 

Now if data Is managed In a unitary way by a single server, accesses to data will 
be concentrated on specific data in accordance with the degree of popularity, degree 
of urgency and degree of Importance of the data, and response to users will 

deteriorate. 

Accordingly, with the object of Increasing speed, data has hitherto been stored in 

advance in distributed cache servers so that it can be taken from a cache server in 
response to an access request from a client. 

However, a problem previously encountered when storing data in a cache server 
is that because the data is stored without taking into consideration characteristics of 
the data such as its degree of popularity, degree of urgency and degree of importance 
- in other words, without having a storage policy - the response when an access is 
made Is sometimes poor. 

The present invention has been devised In the light of this situation, and it is an 
object of the Invention to provide a data storage control program and a data storage 
control method that are capable of Improving the response when data Is accessed. 
Disclosure of invention 

In order to attain this object, the present invention is a data storage control 
program for causing a computer to function as a receiving means for receiving data to 

be stored, as a policy look-up means for looking up a prespecified policy after taking 
mto consideration characteristics of the data to be stored, and as a control means for 
storing the data to be stored In a recording medium of a prescribed node, on the basis 
of the look-up result of the policy look-up means. 

The present invention is also characterised in comprising a receiving step for 
receiving data to be stored, a policy look-up step for looking up a prespecified policy 
after takmg into consideration characteristics of the data to be stored, and a control 
step for storing the data to be stored in a recording medium of a prescribed node on 
the basis of the look-up result of the policy look-up step. 
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The invention thus constituted can store data to be stored, in a recording medium 
of a prescribed node, on the basis of a prespecified poiicy after taking into 
consideration characteristics of the data to be stored, and is therefore capable of 
improving the response to an access. 

Brief Description of the Drawings 

FIG. 1 is a block diagram showing the constitution of an embodiment of the 
present invention. FIG. 2 gives an example of the format of policy data 500 used in 
this embodiment. FIG. 3 is a flowchart serving to clarify the basic operation of 
datacentre server 100 shown in FIG. 1. FIG. 4 is a flowchart serving to clarify the 
basic operation of cache servers 300i-300n shown fn FIG. 1. FIG. 5 is a flowchart 
serving to clarify the operation of cache servers 300i-300n shown In FIG. 6 when a 
fault has occurred. FIG. 6 Is a biocl< diagram serving to clarify the operation when a 
fault has occurred in datacentre server 100 shown In FIG. 1. FIG. 7 is a block diagram 
serving to clarify the operation of this embodiment. FIG. 8 is a block diagram showing 
15 the constitution of a modified version of this embodiment. 

Best Mode for Carrying Out the Invention 

A detailed description will now be given, with reference to the drawings of an 
embodiment of the present invention. FIG. l is a block diagram showing the 
constitution of this embodiment. In this figui^, datacentre server 100 Is provided in 
20 node A of network 200 and Is a server for storing data on disk 101 and in cache 
memory 102 on the basis of policy data 500 (see FIG. 2) to be described hereinafter. 

Disk 101 Is a recording medium having the following characteristics: namely 
access speed is slower than that of cache memory 102, but the amount of data that 
can be recorded is larger. Cache memor/ 102 Is a recording medium capable of high- 
speed access, and Is for example a static random access memory (SRAM). 

XML (Extensible Markup Language) engine 103 is for example an XML editor and 
has functions such as editing contents, receiving data to be stored, and checking a 
script (a programming language) contained in this data. Poiicy control unit 104 looks 
up policy data 500 shown in FIG. 2 and performs control which causes the data to be 
stored in a prescribed location (a disk, a cache mennory) of a prescribed node on the 
basis of a policy lon^sponding to the degree of popu^ty, c^PM^iiiiicy ii^ffie ■■■■ 
of Ih^orlance, el;, of the data to be stored. "'" ' ^■'■'■"■-■■■^^'^^ 

Policy data 500 shown in FIG. 2 comprises a group list, policy specification path 

names, and policy functions. The group iist gives the names of nodes in network 200 
and the functions of the servers present at those nodes. In the example shown in FIG 
2, it IS specified that datacentre server 100 present at node A shown in FIG 1 
functions as a datacentre server. 

Similariy, it is specified that cache server 300, in node B functions as a cache 
server and as an alternative datacentre server. An "alternative datacentre server" is a 
backup server which, when a fault has occurred in datacentre server 100, substitutes 
for datacentre server 100 and performs the same functions as datacentre server 100. 

Cache server functions and alternative datacentre server functions are aiso 
specmed for cache server 300, in node G, and cache server functions are specified for 
cache server 300„ located in node I. 
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Tftf potey spedicatton patfi mm^ a pm mme micmm of a data storage 
(ocatfon and serves to specify poN^ fenctfons. For example, in me case of a policy 
specification path name containing "/mma", it ts gpfetJfied that the dati fh q^egtfen is 
scored on a disk. Here, "data stored on a di5l<" is infrequently-updated data with a 
relatively large data size, sucli as voice data and image data. 

In the case of a policy specification path name containing 7index"V::arfe^ped«^ 
that the data :ls stored on a disk and that pointer data that points to this data is 
sS^m iiwrnmrnmry. in the present embodiment, data is stored on both a disic 
and In a cache memory in order to Increase the speed of data retrieval. 

In thie earn of a policy specifieation path nahie containing 7order-cgi", it is 
specified that data is m stored in cache servers 30O3-300, and that it is stomd only 
rn datacentne setveP iOO and oft dlSk 3©1 of aft alternative datacenti% server. This 
data is data which is frequently updated and for which the update order is critical as 

for example data related to ordering a commodity, and It is data regarding which 
service response performance would deteriorate if it were stored in a cache memory 
For this reason, the data in the alternative datacentre server is stored as a backup. 

Returning to FIG. 1, replicator 105 provides functions such as transfer of data to 
a prescribed node via network 200, based on policy data 500. 

Cache server 300i is provided in node B of network 200 and is a server which on 
the basis of policy data 500 (see FIG, 2), stores data on disk 301 and in cache 
memory 302. Policy data 500 shown in FIG. 2 specifies that cache server 300, 
functions as a cache server and as an alternative datacentre server. 

Disk 301 is a recording medium having the following characteristics: namely 
access speed is slower than that of cache memory 302, but the amount of data that 
can be recorded is larger. Cache memory 302 Is a recording medium capable of high- 
speed access, and is for example an SRAM. XML engine 303 has similar functions to 

XML engine 103. 

Policy control unit 304 looks up policy data 500 shown in FIG. 2 and performs 
control which causes the data to be stored in a prescribed location (a disk, a cache 
mmmm-m m basis of«^^-po«cy correspond«g to the degrei of popularity degree of 
yiHi«y.> degree of importance, etc. of the data to be stored. Replicator 305 has 
functions such as receiving data via network 200. 

Cache server 3OO2 Is provided in node G of networl< 200 and has the same 
constitution as cache server 300i. Namely, cache server 300^ is a server which on 
the basis of policy data 500 (see FIG. 2), stores data on a disk and in a cache 
memory (not illustrated). 

Cache server 300^ is provided In node I of network 200 and has the same 
constitution as cache server 300,. Namely, cache server 300„ is a server which on 
the basis of policy data 500 (see FIG. 2), stores data on a disk and In a cache 

memory (not illustrated). 

Client 400 is provided at the user side or at the datacentre side and is operated 
by a general user, a contents editor, a datacentre manager, etc. This client 400 is a 
computer terminal, a mobile telephone terminal, etc. which is capable of accessing 
datacentre server 100 and cache servers 300i-300n via network 200 



Next, a description will be given, with reference to the flowcharts shown in FIG, 3 
and FIG. 4, of the basic operation of this embodfnnent. FIG- 3 is a flowchart serving to 
clarify the basic operation of datacentre server 100 shown In FIG. 1. FIG. 4 is a 
flowchart serving to ctarify the basic operation of cache servers 300i-300n shown in 
5 FIG. 1. cl<^^ iJi^^-' ^ ^ 

In step SAl shown in FIG. 3, XML engine 103 of datacentre server 100 decides, ' 
for exampie^. whether or not data storage for the purpose of maintaining edited data 
has been requested by client 400 operated by a contents editor. If the result of this 
decision is "No", tt repeats the same decision. 

10 In step SBl shown in FIG* 4, XML engine 303 of cache server 300i decides 

whether or not It has received data to be stored from datacentre server 100, If the 
result of this decision is "No", it repeats the same decision. 

When datacentre server 100 receives data to be stored and policy data 500 (see 
FIG. 2) from client 400 operated by a contents editor (or by a datacentre manager), 
15 XML engine 103 issues the decision result ''Yes" at step SAl shown in FIG, 3, 

In step SA2, XML engine 103 checks whether or not predetermined Infected 
scripts are contained in the data in question. Such infected scripts may be regarded 
as computer viruses and have adverse effects such as system shutdown, XML engine 
103 also performs pseudo-execution of scripts contained in the data and Ignores data 
20 relating to infected scripts. 

In step SA3, XML engine 103 decides, by means of the pseudo-execution, 
whether or not an error — in other words, an infected script — Is contained in the 
data, and tf the result of this decision is "Yes^ it hafts the data storage process. 

On the other hand, if the result of the decision at step SA3 is '^No", then in step 
25 SA4 policy control unit 104 looks up policy data 500 (see FIG. 2). In step SA5, policy 
control unit 104 decides, from the group list of policy data 500, whether or not to 
store the data in another node (i.e., in a node other than node A), 

For example, if node B is specified in the group list, policy control unit 104 issues 
the decision result "Yes" in step SA5. In step SA7, replicator 105 transmits the data to 
30 be stored and policy data 500 to cache server 300i corresponding to node B, via 
network 200. 

When the data to be stored and policy data 500 are received by replicator 305 of 
cache server 300i, XML engine 303 issues the decision result "Yes" at step SBl shown 
in FIG. 4, In step SB2, policy control unit 304 stores the data in a prescribed location 
35 (disk 301, cache memory 302} on the basis of policy data 500. 

For example, if "/media" Is specified as the policy specification path name shown 
in FIG. 2, policy control unit 304 stores the data on disk 301, If "/index" is specified 
as the policy specification path name, policy control unit 304 stores data on disk 301 
and also In cache memory 302. 

40 On the other hand, if node A is specified fn the group list of policy data 500 (see 

FIG, 2), policy control unit 104 of datacentre server 100 issues the decision result 
"No" in step SAS shown in FIG. 3, 



In step SA6, policy control unit 104 stores the data in a prescribed location (disk 
101, cache memory 102) on the basis of policy data 500. 

For example, if "/media" Is specified as the policy specification path name shown 
in FIG. 2, policy control unit 104 stores the data on disi< 101. If "/index" is specified 
as the policy specification path name, policy control unit 104 stores data on disk 101 
and also In cache memory 102, 

Next, a description will be given, with reference to FIG. 5 and FIG. 6, of the 
substitution that takes place when a fault has occurred in datacentre server 100. FIG, 
5 is a flowchart serving to clarify the operation of cache servers 300i-300n shown in 
FIG. 6 when a fault has occurred. 

In step SCI shown In FIG. 5, replicator 305 of cache server 300i shown in FIG. 5 
decides, on the basis of a periodic "are you alive?" enquiry sent to datacentre server 
100, whether or not a fault has occurred In datacentre server 100. If a fault has not 
occurred, it issues the decision result "No". In similar manner, the decision made at 
step SCI is carried out in each of cache servers 3002-300n. 

When a fault occurs in datacentre server 100, replicator 305 of cache server 300i 

issues the decision result "Yes" in step SCI, In step SC2, policy control unit 304 of 
cache server 300i looks up the group list (i.e., the functions) of policy data 500 (see 
FIG. 2), decides whether or not It is designated as an alternative datacentre server, 
and if it Is, issues the decision result "Yes", [i]* 

In step SC3, policy control unit 304 of cache server 300i decides whether or not 
the substitution level is the highest level. The "substitution level" is specified in the 
group list of policy data 500 (see FIG. 2, although substitution level is not shown 
there), and indicates the substitution priority ranking when a plurality of cache 
servers have been designated as alternative datacentre servers. 

In the present case, assuming that the substitution level of cache server 300i Is 

the highest level, policy control unit 304 of cache server 300i issues the decision 
result "Yes" in step SC3, In step SC7, policy control unit 304 of cache server 300i 
decides whether or not cache server 300i is operating normally. If the result of this 
decision is "No", then because a fault has occurred, cache server 300i cannot function 
as an alternative datacentre server. 

On the other hand, if the result of the decision in step SC7 is "Yes", then in step 
SC8 policy control unit 304 of cache server 300i transmits a substitution message to 
the other cache servers 3002-300n to the effect that it [2] will function as the 
datacentre server in place of datacentre server 100. 

When a fault occurs In datacentre server 100, the decision result "Yes" in step 

SCI is also issued by cache server 30O2. In step SC2, cache server 30O2 looks up the 
group list (I.e., the functions) of policy data 500 (see FIG. 2), decides whether or not 
it is designated as an alternative datacentre server, and if it is, issues the decision 
result "Yes". 

In step SC3, the other cache servers 3002-300,, decide whether or not the 
substitution level is the highest level. If it is not, the decision result is "No". [3} In step 
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SC4, cache servers 3002-3000 decide whether or not a substitution message has been 
received fronr^ an alternative datacentre server (in this case, cache server 300i). 

In the present example it is assumed that cache servers 3002--300n have received 
a substitution message from an alternative datacentre server (In this case, from cache 
server 300^ and therefore that they issue the decision result "Yes" in step SC4, As a 
result, cache server 300i functions as the datacentre server and cache servers 3002™ 
300n recognise cache server 300i as the datacentre server. 

On the other hand, if the result of the decision at step SC4 is "No", then at step 
SC5 cache servers 3002"300n decide that a fault has occurred in the substituted 
datacentre server as well (in this case, in cache server 300i). 

In step SC6, cache servers 3002--300n decrement the highest level by one and 
then perform the decision of step SC3. As a result, the cache server with the next 
highest priority ranking after cache server 300i — for example, cache server 30O2 — 
functions as the alternative datacentre server. 

Next, a description will be given, with reference to FIG. 7, of the operation in the 
case of accessing data for which the update order is critical. The example used to 

illustrate this is the case where a user purchases goods online. Portions of FIG. 7 that 
correspond to portions of FIG. 1 have the same referencing numerals. Firstly, when 
goods are searched for by client 400 (see FIG, 1) operated by a general user, then as 
shown by the arrowed line marked (1), the data Vindex/lteml.htm,.." stored in cache 
memory 302 of cache server 300n in node I (which functions only as a cache server) 
is accessed. If a commodity is looked up, then as shown by the arrowed line marked 
(2), Vmedia/plcturel.jpg,./* stored on disk 301 of cache server 300n is accessed. In 
this case, because only cache server 300n is accessed, service performance is 
maintained. 

If a commodity is ordered, then as shown by the arrowed line marked (3), cache 
server 300^ is bypassed and Vorder^cgi/itemlxgi,,.;' stored on disk 101 of 
datacentre server 100 is accessed. 

Note that Vorder^cgi/iteml.cgi,..." is backed up in cache server 300i or 30O2 of 
node B or G (which function as alternative datacentre servers). 

As has been described above, according to this embodiment, because data is 
stored in a recording medium (a disk, a cache memory) of a prescribed node on the 
basis of prespecified policy data 500 (see FIG. 2) after taking into consideration 
characteristics of the data to be stored, the response to an access can be Improved. 

Moreover, as was described in connection with step SA2 of FIG. 3, according to 
this embodiment a script (a programming language) contained in the data to be 

stored Is checked, and if this check reveals an error, storage of the data in question is 
halted. It is therefore possible to exclude, in advance, data that would have an 
adverse effect. 

Furthermore, according to this embodiment, if the occurrence of a fault at node A 
(datacentre server 100) Is verified, the functions of node A are performed by the local 
node [4] (for example, by node B). As a result, system-ievel backup is available and 
reliability is Improved. 



In the foregoing, an embodiment of the present invention has been described in 
detail with reference to the drawings. Nevertheless, specific examples of the 

constitution of this invention are not restricted to this embodiment, and design 
modifications and the iike that remain within the spirit of the invention are also 
5 encompassed by the invention. 

For example, the functions described above in the foregoing embodiment may 
also be implemented by recording, on computer-readabie recording medium 700 
shown in FIG. 8, a program for implementing the functions of datacentre server 100 
and cache servers 300^-300,. and by causing this program recorded on recording 
) medium 700 to be read and executed by computer 600. 

Computer 600 comprises central processing unit 610 for executing the above- 
mentioned program, an Input device 620 such as a keyboard or a mouse, a read only 
memory 630 for storing various kinds data, a random access memory 640 for storing 
operating parameters and the like, a reading device 650 for reading the program from 
recording medium 700, an output device 660 such as a display or a printer, and a bus 
670 for connecting the various devices, 

After CPU 610 has read the program recorded on recording medium 700 via 
reading device 650, the functions described above are implemented by running the 
program. Recording medium 700 may be an optical disk, a flexible disk, a hard disk 
etc. ' 

An effect of the present invention is that, because as has been described above it 
stores data to be stored in a recording medium of a prescribed node on the basis of a 
prespeclfied policy, after taking into consideration characteristics of the data to be 
stored, it is capable of improving the response to an access. 

A further effect of the present invention Is that, because It stops storing data to 
be stored when, having checked the programming language contained in this data. It 
IS found that there is an error, it is capable of excluding, in advance, data which 
would have an adverse effect. 

Yet another effect of the present invention derives from the fact that when the 
occurrence of a fault at a certain specific node has been confirmed, the functions of 
that node are Implemented at a local node. As a result, system-level backup is 
available and reliability Is improved. 

Possibility of industrial Utilisation 

As has been described above, the data storage control program and data storage 
control method of the present Invention will be useful for cases where data are to be 
stored on a recording medium such as a disk or a cache memory of a prescribed node 
on the basis of a policy. 
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1. A data storage control program for causing a computer to function as: 
a receiving means for receiving data to be stored; 

a policy lool<-up means for looking up a prespecifled policy after taking into 
consideration characteristics of the data to be stored; and 

a control means for storing said data to be stored in a recording medium of a 
prescribed node, on the basts of the loot<-up result of said policy look-up means, 

2. The data storage control program according to Claim l, characterised in that it 
causes said computer to function as a checking means for checking the programming 
language contained in said data to be stored, wherein said control means discontinues 
the storing of the data in question when the result of checking by said checking 

means is that there is an error. 

3. The data storage control program according to Claim l, characterised in that it 
causes said computer to function as a monitoring means for monitoring the 

occurrence of faults in a prescribed node, and as a backup means for implementing 
the functions of said prescribed node by the local node when the occurrence of a fault 
has been confirmed by said monitoring means. 

4. A data storage control method characterised In that it comprises; 
a receiving step for receiving data to be stored; 

a policy look-up step for looking up a prespecifled policy after taking into 

consideration characteristics of the data to be stored; and 

a control step for storing said data to be stored in a recording medium of a 
prescribed node, on the basis of the look-up result of said policy look-up step. 
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TRANSLATOR'S NOTES 



1. The Japanese sentence does not state explicitly what is designated (or not) as an 
alternative datacentre serve. It is presumably the local cache server which is so designated 
- I.e., cache server 300j in which policy control unit 304 is located. 

2. I.e., cache server 300i. 

3. The description given of this aspea of the invention, i.e., of how the prioritizing subsystem 

operates, seems to me to be very sketchy indeed. i>v.yj.ysicni 

4. Sic The writer appears to be using the term "local node" to indicate the node at which the 
fault in the original datacentre server Is verified. 
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